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ANTI-LEU 3a AMINO ACID SEQUENCE 



Field of the Invention 

This invention relates to an amino acid se- 
quence for an anti-CD4 antibody and more particu- 
larly relates to an amino acid sequence for the anti- 
CD4 antibody. Anti-Leu 3a, and a chimeric variant 
thereof. 



Background of the Invention 

CD4 is an antigen on certain T lymphocytes 
(i.e n the helper subset) that has a molecular weight 
oTapproximately 55 Kd. It Is thought to consist of 
four domains (Vi - V*) that extend from the cell 
membrane outward in serial fashion. 

Recently, the CD4 antigen or CD4* cells have 
been implicated in certain Immune system dis- 
eases ranging from autoimmune diseases, such as 
rheumatoid arthritis and multiple sclerosis, to AIDS. 
Treatment of these diseases with antj-C 04 . agents 
{I.e., chemical or biological materials that bind to or 
block the function of the CD4 antigen) has been 
suggested- See, ejj.. US. Pat- No. 4,695,459 and 
Weber et al., Scl. Amer.. 259:101 (1988). 

Not airanti-CD4 agents, and in particular not all 
antj*CD4 monoclonal antibodies, however, have the 
same effect on the CD4 antigen or CD4 * cells. 
That is, not all anti-CD4 monoclonal antibodies bind 
to the same region or epitope on CD4. Where a 
particular monoclonal antibody binds to CD4 is 
important In a disease such as AIDS, for example. 
Sattentau et al.. Science. 234:1120 (1986), showed 
that not all afrtl-CD4 antibodies would cross-block 
each other in competitive binding studies with 
CD4* cells or would block the binding of HIV to 
CD4* cells. Recent work, summarized by Weber et 
at., supra, suggests that a specific amino acid 
sequence in the outermost or Vi domain of the 
CD4 molecule is site where HIV binds to CD4, and 
suggests that this is the site (or very near to it) 
where the monoclonal antibody. anti-Leu 3a 
(Becton Dickinson immunocytometry Systems, 
BDIS), binds. Also near this site, but different from 
the site to which Anti-Leu 3a binds, is the site at 
which another monoclonal antibody. OKT4a (Ortho 
Diagnostics), binds. 

The ability of Anti-Leu 3a to block HIV binding 
has been attributed to its structure being the same 
or nearly the same as a portion of the gp120 region 
of the Hrv virus. The gpl20 region of the virus has 
been shown to bind to CD4. Thus, Anti-Leu 3a has 
a structure that looks tike the HIV binding region on 
gp120 but lacks the disease carrying properties of 
the virus itself. As result the use of Anti-Leu 3a as 



a vaccine is being attempted. See Matthews et al., 
* Sci. Amer., 259:120 (1988). " 
Anti-Leu 3a Is a mouse monoclonal antibody 
derived from done SK3 which was originally de- 

5 scribed by Evans et a).. PNAS. 78:544 (1981). SK3 
was derived from hybridization of mouse NS-1 
myeloma cells with spleens from BALB/c mice 
immunized with sheep red blood cell rosettes of 
human peripheral blood. It is a IgQi type antibody 

io having a kappa tight chain. Because Anti-Leu 3a is 
a mouse monoclonal antibody, however, its use a 
therapeutic agent, such as an AIDS vaccine, raises 
certain problems. 

Mouse monoclonal antibodies or in fact any 

is other non-human antibodies will be immunogenic 
when used as a human therapeutic agent The 
mouse monoclonal antibody will be recognized as 
being "non-self by the recipient's immune system. 
Thus, once a mouse monoclonal antibody has 

so been used, anti-mouse antibodies witl be formed in 
the host which then will Emit the subsequent effec- 
tiveness of the agent in the course of further or 
subsequent treatment. Accordingly, ft would be 
preferabfe to use human monoclonal antibody. The 

25 preparation of a human antibody, however, raises a 
number of serious practical and ethical questions. 

One alternative method to make the mouse 
monoclonal antibody less immunogenic while re- 
taining the binding specificity of antibody Is to 

30 make the antibody "chimeric." In one embodiment 
of this format the mouse variable region of the 
antibody is coupled to the constant region of the 
same type antibody but from another species or 
another strain of the same species. In the preferred 

3S human therapeutic embodiment, the other species 
Is human. For veterinary purposes, the species to 
be treated will comprise the other species. 

Simply stated, the gene for the mouse variable 
region Is isolated and spliced by appropriate ge- 

40 netic engineering techniques, such as those de- 
scribed in chapters 5 and 6 of Recombinant DMA: 
A Short Course (Watson et at, eds., 1983), to the 
gene for the human constahrregion. The resulting 
gene then will code for an immunoglobulin having 

45 the mouse variable region and human constant 
region. A vector carrying this new gene then may 
be placed in a prokaryotic or eukaryotic organism 
for expression of the immunoglobulin. A specific 
embodiment for this type of chimeric antibody for 

so expression in eukaryotic organisms is set forth in 
USSN 644,473 filed August 27, 1984, and tor ex- 
pression in prokaryotic organisms is set forth In 
USSN 483,457 filed April 8. 1983. 

The application USSN 644.473 corresponds 
with published European Patent Specification 
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173494. The application USSN 483.457 corre- 
sponds with published US Patent 4816567. 

Another embodiment of a chimeric format Is 
set forth In UK Pat AppJ. GB 2 188 638 filed March 
27, 1888. and also is described in Jones et aL, 
Nature, 321:522 (1986). In this approach, the 
chimeric antibody is •humanized" by introducing 
more human sequences into the variable gene re- 
gion while retaining the specific mouse binding 
regions. 

This embodiment of an antibody takes advan- 
tage of the fact that the variable region of both 
mouse and human immunoglobulins is comprised 
of four framework residues (FRs) which are inter- 
spersed with three complementarity determining 
residues (CDRs). The CDRs mediate antigen bind- 
ing. Thus, in this approach, the mouse variable 
region genes coding for one or more of the CDRs 
are spliced into the human framework variable re- 
gion and the resulting "mosaic" variable region 
may be spliced onto the human constant region 
forming a different type of chimeric antibody- Vec- 
tors containing these constructs may be transferred 
In to expressions systems as described above. A 
schematic for the traditional and mosaic forms of 
chimeric antibodies is shown in FIG. 1. 

The ability to manipulate the gene sequences 
tor any particular monoclonal antibody provides 
one further opportunity to design the antibody to 
precise specifications. By using appropriate recom- 
binant DMA techniques, it Is possible to create 
specific mutations in the nucleic acid sequence of 
the variable region gene. In some cases, this will 
lead to a change in the amino acid sequence of the 
immunoglobuBn. This change, In him, may lead to 
a contormationa! change in the structure of the 
immunoglobuBn which may improve or diminish the 
binding capability or activities of the im- 
munoglobulin. Thus, it may be possible to design 
two different Immunoglobulins that have the same 
conformation and binding specificity but have dif- 
ferent sequences. 

This abiiity will effect not only the therapeutic 
format of the antibody but also will enable one to 
Improve the diagnostic characteristics of the anti- 
body. For example, one might use the chimeric or 
mosaic format to construct an antibody fragment 
(e.g. f Fab' ) with different secondary properties but 
having the same or improved binding capabilities. 

While the importance of the ability to change 
the conformation or structure of an immunoglobulin 
or to render it less immunogenic Is unquestioned, 
the ability to make this changes requires identifica- 
tion of the specific sequence of the antibody in 
question. Thus. In order to make a chimeric or 
mosaic version of an antibody or to alter its con- 
formation, the sequence must be known. Once it Is 
known, Improvements In its therapeutic capabilities 



then may be made. 



Summary of the Invention 

This invention comprises the amino acid se- 
quence for the monoclonal antibody Anti-Leu 3a 
variable region. It further comprises the amino acid 
sequence tor each of- the COR portions of the Arrtj- 
io Leu 3a variable region, A chimeric antibody having 
the amino add sequence for the variable region 
and a mosaic antibody having one or more of the 
amino acid sequences for the CDR portion of the 
variable region also are claimed. 

Description of the Drawings 

FIG. 1 comprises a schematic comparison of 
20 the chimeric and mosaic IgG antibodies. Chimeric 
antibodies consist of mouse variable regions 
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and human constant regions 
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The mouse variable region contains the CDR (■) 
and the FR (□). Mosaic antibodies consist of syn- 
thetic mouse human variable regions 
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containing mouse CDRs ( ■ ) and human FRs (S3 ) 
and human constant regions. 

FIG. 2. comprises the nucleotide chain and 
deduced amino acid sequence of the cloned Anti- 
Leu 3a Pght chain variable region gene, 208 V». 

FIG. 3 comprises the nucleotide and de- 
duced amino acid sequence of the cloned Anti-Leu 
3a heavy chain variable region gene, 31 6- V* 

FIG. 4 comprises the nucleotide and pre- 
dicted amino acid sequence of the mosaic Anti-Leu 
3a light chain variable region gene, KOU206-V\. 
RG. 5 comprises the nucleotide and pre- 
50 dieted amino acid sequence of the mosaic Anti-Leu 
3a heavy chain variable region gene, K0L/318-V„. 

FIG. 6 comprises a schematic synthesis 
strategy for mosaic fight and heavy chain variable 
regions, wherein FRs are shown by thin boxes, 
CDRs as thick boxes, restriction sites by arrows 
and overlapping, single- stranded oflgomideotWes 
are represented by solid lines below each gene. 
FIG. 7 comprises partial DNA restriction 
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maps of human kappa and human gamma 1 ex- 
pression vectors containing Anti-Leu 3a variable 
region sequences wherein exons are indicated by 
open boxes, enhancer elements are shown as open 
circles, dominant selectable markers are represent- 
ed by shaded boxes, and antibiotic resistance 
genes are depicted with broken tines. 

For FlG-s 2-5. the deduced amino acid se- 
quence is shown below the nucleotide sequence in 
three letter code. The sequence of the mature 
protein is capitalized while the leader peptide is in 
lower case letters. CDRs are boxed. DNA regula- 
tory elements (ej.. Parstow box and TATA box) 
are highlighted in boldface letters. Donor and ac- 
ceptor spOce sites are underlined and splice junc- 
tions are indicated be an up arrow. 



Detailed Description of the Invention 

Organization of mouse immunoglobulin 
genome has been well described by Honjo, Ann. 
Rev. Immunol.. 1:499 (1983). Briefly, the im- 
munoglobulin gene system comprises three sepa- 
rata loci of Ik. U and H (light and heavy) chain 
genes, each chain containing the variable (V) and 
constant (C) genes. The L*, L\ and H chain genes 
are located on mouse chromosomes 6, 16 an 12 
respectively and on are located on human chro- 
mosomes 2. 22 and 14 respectively. Within each of 
the genes, a variety of Introns and exons may 
exist 

Referring to FIG. 2. the variable region se- 
quence for the kappa light chain of Anti-Leu 3a was 
determined as follows. DNA containing the mouse 
variable region light chain gene was isolated from 
clone SK3 by screening a genomic library using 
hybridization probes. Genomic DNA was partially 
digested with Moot, Hgated into the lambda re- 
placement vectors EMBL3 or EMBL4, packaged, 
and amplified on an appropriate E. coP host The 
libraries were screened using a -0.85 Kb Sad 
Hindi l! probe from the mouse J x -C* intron, or a 
-0.70 Kb Xbal EcoRJ enhancer probe from the 
mouse heavy chain Intron.. Positive clones were 
characterized by Southern blotting, then were sub- 
cloned into the cloning vector pUC13. 

Once the DNA for each of the light and heavy 
chain variable regions was isolated. It was sequen- 
ced by the chain termination method of Sanger et 
aL, PNAS, 745463 (1977). Appropriate restriction 
fragments were subctoned into the sequencing 
vectors Ml3mpi8, M13mpl9 and PTZ18R, and the 
cloning vector pUC13. W S- labelled templates were 
sequenced with Ktenow fragment or AMV-reverse 
transcriptase. 

The nucleotide sequence from positions 538 
through 882 comprises the coding region for the 



Anti-Leu 3a V« chain. The amino acid sequence 
corresponding to the nucleotide sequence from po- 
sitions 549 through 881 further comprises the 
structure of the variable region of the Anti-Leu 3a 

5 kappa light chain. 

Referring to FIG. 3, the variable region se- 
quence for the heavy chain of Anti-Leu 3a was 
determined as described above for the Dght chain 
except that appropriate restriction fragments were 

to subctoned Into the sequencing vectors M13mp18 
and M13mp19 (Pharmacia), and the cloning vectors 
pUCl2andpUC13. 

The nucleotide sequence from positions 373 
through 738 comprises the coding region for the 

75 Anti-Leu 3a V H chain. The amino acid sequence 
corresponding to the nucleotide sequence from po- 
sitions 384 through 737 further comprises the 
structure of the variable region of the Anti-Leu 3a 
heavy chain. 

20 Once Isolated, the nucleotide sequences for 
each or either of the variable region chains may be 
combined with nucleotide sequences for the human 
constant region. Once combined, the vector into 
which they are combined may be placed in an 

25 expression system for production of the chimeric 
antibody. Preferably, the method of Morrison and 
Oi as set forth In USSN 644,473 is used to con- 
struct the chimeric antibody; however, the methods 
set forth in USSN 483,457 may be used to con- 

30 struct the chimeric antibody in a prokaryotic sys- 
tem, such as E. coli . 

Briefly, the 206- V* and 316-V H variable region 
genes were spliced to human kappa and human 
gamma 1 constant region genes and were Inserted 

35 into the vectors pSVl84neo and PSV2AHgpt re- 
spectively. The 206-V, gene was. spliced to the 
human * gene at a unique Hindi!! site located in 
the large intron between the J* and C* exons. The 
316-Vh gene was spliced to the human gamma 1 

40 gene at a unique EcoRI site located in the large 
Intron between the J H and Cm exons. 

Transfection of a eukaryotic cell fine was ac- 
complished by the method described by Morrison 
et ai., PNAS. 81 :6851 (19B4). 

46 One Chimeric cell line that resulted from this 
transfection was named V23. This cell fine pro- 
duces a a chimeric mouseiiuman gamma 1 im- 
munoglobulin that binds to the CD4 antigen on 
CD4* cells as confirmed by flow cytometric analy- 
se ses and as confirmed by sr^S-polyacrylamlde gel 
electrophoresis of antigen antibody complexes 
from labelled cells. 

Referring to FIG.S 4 and 5, the nucleotide se- 
quence of the V» and V H CDRs was determined 

65 next The mosaic Dght chain variable region was 
synthesized from two 200 base-pair restriction frag- 
ments, each consisting of sbc oligonucleotides. The 
mosaic heavy chain variable region was syrrthe- 
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sized from two 100 base-pair restriction fragments 
consisting of four oligonucleotides each and a 200 
base-pair restriction fragment consisting of eight 
oligonucleotides. See RQ. 6. 

Taking advantage of the degeneracy of the 
genetic code, unique restriction sites then were 
designed Into each gene to facilitate CDR replace- 
ment by cassette mutagenesis. Thus, an altered 
amino acid sequence could be Introduced by re- 
placing the existing DNA sequence with a synthetic 
double-stranded restriction fragment The human 
myeloma cell line KOL was used as a source of 
FRs for both the V* and V H chains even though it 
has a lambda light chain. KOL has been previously 
described by Bernstein et ai.. J. Mol. Biol.. 112:535 
(1977). The resulting mosaic genes then were sub- 
cloned into the bacterial doning vectors PTZ18R 
and PTZ19R (Pharmacia) and were sequenced on 
both strands by the chain termination method de- 
scribed above. 

Referring to FIG. 4, the nucleotide sequence 
from positions 43 through 387 comprise the mosaic 
Vjt chain. The sequences from positions 120-164, 
210-230 and 327-353 comprise the CORi. CDR 2 
and CDRa regions respectively. The amino acid 
sequence for each CDR then was deduced from 
the nucleotide sequence. 

Referring to RQ.. 5, the nucleotide sequence 
from positions 20 through 385 comprise the mosaic 
Vh chain. The sequences from positions 121-136, 
17B-228 and 325-351 comprise the CDRi, CDR2 
and CDRa regions respectively. The amino acid 
sequence for each CDR then was deduced from 
the nucleotide sequence. 

As described above, expression vectors then 
were made to splice the mosaic constructs to the 
respective constant region genes. The vectors 
pSV184AHneo and PSV2AHgpt were used with the 
V, and V H mosaics respectively. Constant regions 
from KOL also were used as previously described. 
Each vector was modified by site-directed 
mutagenesis to permit insertion of the variable re- 
gion mosaics. See FIG. 7. 

Transfection of a eukaryotic cell line was ac- 
complished by the method described in Oi and 
Morrison. BtoTedtriques, 4:214 (1986). 

One mosaic eel) One that resulted from this 
transfection was named 181-21. This cell line pro- 
duces a mosaic mouse:human gamma 1 im- 
munoglobulin that binds to the CD4 antigen on 
CD4* cells as confirmed by flow cytometric analy- 
ses and as confirmed by SDS-coryacrytamide gel 
electrophoresis of antigen-antibody complexes 
from labelled cells. 

The V23 and 181-21 cell Ones described above 
that produce the chimeric and mosaic Anti-Leu 3a 
antibodies have been deposited in the laboratory of 
Dr. Vernon Oi at the Becton Dickinson Monoclonal 



Center, Mountain View, California. It will be appar- 
ent to one skilled in the art that these are not the 
only possible combinations of Ami-Leu 3a derived 
sequences. Other combinations of sequences 

s could Include 1) having less than all CORs in the 
construct be of mouse origin (e.g., combining 
CDRi In one or both chains from Ami-Leu 3a wtth 
CDFb and COR* from another spedes), 2) having 
only the heavy or light chain be chimeric or mosaic 

10 (and thus having the other chain be totally of 
mouse origin) and 3) having a construct comprised 
of a mosaic chain and a chimeric chain. Other 
constructs may be made of different nucleotide 
sequences that have the structure and .anti-CD4 

ts binding function of Anti-Leu 3a but not do not 
necessarily have the entire Anti-Leu 3a sequence. 
Still other constructs may represent only a portion 
of the antibody (ej., Fab'fragment ). Finally, other 
constructs could be made comprising antj-CD4 

20 peptides that have both Anti-Leu 3a like binding 
functions (but' lack an immunoglobulin structure) 
and the sequence of any of the variants described 
in RC..S 2-5. 

All publications and patent applications' men- 

25 tioned in this specification are indicative of the level 
of ordinary skill in the art to which this Invention 
pertains. Ail publications and patent applications 
are herein incorporated by reference to the same 
extert as if each individual publication or patent 

so application was specifically and individually indi- 
catec to be incorporated by reference. 

It will be apparent to one of ordinary skill in the 
art that many changes and modifications can be 
made in the Invention without departing from the 

as spirit or scope of the appended claims. 



Claims 

40 1. A chimeric antibody having a human con- 
stant region and having a mouse variable region 
comprising an amino acid sequence for a V, chain 
as described In FIG. 2. 

2_ A chimeric antibody having a human con- 

45 stant region and having a mouse variable region 
comprising an amino add sequence for a V H chain 
as described In RG. 3. 

3. A chimeric antibody having a human con- 
stant region, and having a mouse variable region 

so comprising an amino acid sequence tor a V x chain 
as described In RG. 2 and an amino acid se- 
quence for a V H chain as described in RG. 3. 

4. An antibody having a mouseiiuman mosaic 
variable region comprising an amino add sequence 

55 for a V* chain as described in RG. 4. 

5. An antibody having a mouse^uman mosaic 
variable region comprising an amino add sequence 
for a V H chain as described in RG. 5. 



5 



9 EP 0 365 209 A2 10 



6. An antibody having a mouse .human mosaic 
variable region comprising an amino acid sequence 
for a V, chain as described In FIQ. 4 and an amino 
add sequence for a V H chain as described In FIG. 

5. 5 

7. A chimeric antibody having a human con- 
stant region, and having a mouserhuman mosaic 
variable region comprising an amino add sequence 
for a V« chain as described in FIG. 4 and having 

an amino add sequence for a V H chain as do- ro 
scribed In FIG. 5. 

8. An antibody having a mouse: hum an mosaic 
variable region comprising at least one of the CDR 
amino add sequences as described in FIG. 5. 

9. An antibody having a mouseihuman mosaic is 
variable region comprising at least one of the CDR 
amino add sequences as described in FIG. 6. 

10. A nucleotide sequence for an anti-CD4 
agent comprising any of the nucleotide sequences 

as described in any of FlG.s 2-5. 20 
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l0 20 30 4" 50 GO 

TTTAAATCAT«CTTTirrAA(;t:CTTAAATAAACACAAATAAACATTCACCCACAAAACAAA 

70 HO 90 100 110 120 

AAAAAAAAACACACATnTnATCCTCTACTAAATTTrCTGCTCA(:ACCAACTCCCCATt-GG 

l30 140 150 160 170 180 

TGAAACACCACCACGTCCCCTTCCACCAACACCACATACTCTCCTCATTTGCATATCAAA 

PARS LOW 

190 200 210 220 230 240 

TAATTTTATAACAGCCCACGCTTCTTTAAGCGCAGCTGCCACGAGCCTAACAACCATCCT 

TATA 

250 260 270 280. 290 300 

CTCATCTAGTTCTCACACATCGAGACAGACACAATCCTCCTATGGGTGCTCCTCCTCTCC 
metgluthraspthrileieuleutrpvalleuleuleutrp 

310 320 330 340 350 360 

CTTC CAGGTGAGAGTCC AG AG A AGTGTTGC G AG C A ACCTCTGCG ACC ATC ATG ACTTTCC 
valprofit 

370 380 390 400 410 420 

ATGCATATGGACTCCTGAATCTTATAATTAATCCATTTGTAATTGGTTITAACTTTCCTG 

430 440 450 460 470 480 

ATTCCCTTTC AGTTCCTGATGTCTC ATATTG A T GTC C ACA AC ATTCTTTAT ATTTTTAAA 

490 500 510 520 530 540 

TGAAATCGGAAGTCCTTTATACATATATAACAATTGTCTGTGTGTT TATCATTCCAGGC T 

jLys 

550 560 570 580 590 600 

CCACTGGTGACATTGTGCTGACCCAATCTCCAGCTTCTTTGGCTGTGTCTCTAGGGCAOA 
erthrglyAspIleValLeuThrGlnSerProAlaSerLeuAlaValSerLeuGlyGlnA 



610 620 



630 640 ; . 650 660 

CCGCCACCATCTCCTCC^ACGCCACCCAAAGTGTTCATTATGATGGTGATAGTTATATGA 
^^AlAThrTl^SerCYs lLvsAlaSerGlnSerValAspTyrAsp GlYAsDSerTyrMetA 

690 700 .710 720 



670 680 



AcTrCGTACCAACAGAAACCAGGACAGCCACCCAAACTCCTCATCTAT 3CTGCATCCAATC 
sjrrpTyrGlnClnLysProGlyGlnProProLysLeuLeuIleTyr^laAlaSerAsnL 



730 740 750 760 770 780 

TACAATCrifccCATCCCAGCCACATTTACTCCCAGTCCGTCTCCGACAGACTTCACCCTCA 
euGluSer31yIleProAlaArgPheThrClySerGlySerGlyThrAspPheTbrUeuA 



790 800 



810 820 830 840 



ACATCCATCCTGTGCACGAGGAGGATACTCCAACCTATTACTGTtAACAAACTTATCACC 
snlleHtsProValCluCluGluAspThrAlaThrTyrTyrCyshlnGlnSerTyrCluA 



850 860 870 880 890 900 

ATCCTCCCACA|rTCGCTGCAGGCACCAACCTGCAAATCAAGCGTAAGTACAATCCAAACT 

spProProThiJ>heAloGlyGIyThrAsnLeuGluIleLysAl 
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M) 2Q 30 4<> 50 ,; » 

(;aattcacaaaaa(;tttatg(*catacatttcci"Ca(IacaccaataC(iatttt(jACGtcai; 

'70 80 90 iOO MO 120 

CATCCTGCTtii^rTGACCCATGTCATCACACrrTC.-TTCACCTCCACTAGGTCCTTATCTAAC 

130 HO ISO 160 170 180 

AAATACACCCCTCATCAATATGTAAATCACCCGAGTCTATGGCAGGTAATACTGCCATGT 

PARS LOW TATA 

190 200 210 220 230 240 

CCACACCATCAAAACAACCTATCATCAGTGTCATCTCCACAGTCCCTCAACACACTGACT 

250 260 270 280 290 300 

CTAACCATCGAATGGAGGATCTTTCTCTTCATCCTGTCAGGAACTG CAGGTAAGGG GCTC 

metglutrpargilepheleupheileleuserglythralagj 

. 310 320 330 340 350 360 

ACCACTTCCAAATCTGAAGTCGAGACAGGACCTGAGGTCACAATGACTTCTACTCTGCCT 

370 380 390 400 410 420 

T TCTCTCCACAGGT GTCCACTCCCAGGTTCAGCTGCAGCAGTCTGGACCTGAGCTGCTGA 
(LyvalhisserGlnValGlnLeuGlnGlnSerClyProGluLeuValL 

430 440 450 460 470 480 

AGCCTGGGGCTTCAGTGAAGATGTCCTGCAAGGCTTCTGGATACACATTCACT OACTATG 
ysProGlyAlaSerValLysMetSerCysLysAlaSerGlyTyrThrPheThriAspJ^ 

490 500 510 520 530 540 

TTATAAACh*GGGTGAAGCAGAGAACTGGACAGGGCCTTGAGTCGATTGGA 3AGACTTATA 
alIleA3i^ rpValLysGlnArgThrGlyGlnGlyLeuGluTrpIleGly )3luThrT y rT 

550 560 570 580 590 600 

CTGGAAGTGGCAGTAGTTATTACAATGAGAAGTTCAACGACkAGGCCACACTGACTGTAG 



hrGlySerGlySerSerTyrTyrAsnGluLysPheLysAsij LysAlaThrLeuThrValA 

610 620 630 640 650 660 

ACAAAGCCTCCAATATAGCCTACATGCAGCTCAGCAGCCTGACATCTCACGACTCTCCGG 
spLysAlaSerAsnlleAlaTyrMetGlnLeuSerSerLeuThrSerGluAspSerAlaV 

670 680 690 700 710 720 

TCTATTTCTGTCCAACAbGGCGTAAACGAACCGGGTTTUCl 1 1 llrGGGCCCAACGGACTC 



• 730 ' 740 750 760 

TGGTCACTGTCTCTGCA^GTGAGTCCTAACTTCTCCCATTCTAGA 

euValThrValSerAlaG{ 
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10 20 30 <0 50 60 

CAATTrTCTnTt;TAACTATGTATTTCTCTCTCATTCTTTCACCTTCCACCAGTCAAACCC 

paserscrserCluSerV 

70 80 90 100 U0 120 
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